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1.  Summary 

1 . 1 Project  Goals 

The  purpose  of  this  project  is  to  support  the  ARPA-NMRO  seismic 
data  activity  by  providing  data  storage  and  retrieval  services. 
The  Arpanet  will  be  used  as  the  primary  communications  channel. 

As  part  of  the  service,  seismic  data  will  be  (a)  recieved  from 
tie  Arpanet;  (b)  stored  and  indexed  in  the  Datacomputer ; and 
(c)  made  available  to  computers  on  the  Arpanet  in  a convenient 
and  timely  manner.  These  services  represent  a special  applica- 
tion of  the  Arpanet  Datacomputer  being  developed  and  maintained 
by  Computer  Corporation  of  America  (CCA)  under  Contract  No. 
MDA903-74-C-0225. 

The  amount  of  seismic  data  to  be  stored  requires  the  addition 
of  a mass  memory  to  the  Datacomputer  system.  An  Ampex  Terabit 
Memory  System  (TBM)  with  a capacity  of  almost  two  hundred 
billion  bits  will  be  installed  at  CCA  in  January  1976  to  answer 
this  need.  Modifications  to  CCA's  computer  site  are  necessary 
to  accommodate  the  TBM. 

The  other  hardware  item  vital  to  this  project,  besides  the  TBM, 
is  a small  Seismic  Input  Processor  (SIP).  The  SIP  will  perform 
several  functions,  the  most  important  of  which  is  to  continuously 
collect  data  over  the  network,  reformat  and  buffer  the  data,  and, 
at  intervals,  generate  a datalanguage  update  request  and  burst 
the  data  to  the  Datacomputer  via  the  local  CCA  network  node. 

The  CCA  TIP  has  been  replaced  as  the  local  network  node  with  a 
516  IMP  to  provide  the  required  bandwidth. 

1.2  Technical  Status  of  the  Project 

Project  activity  can  be  divided  into  four  areas:  (1)  SIP 

development  and  network  bandwidth  considerations;  (2) 
coordination  with  the  seismic  community;  (3)  TBM  acquisition 
and  integration  into  the  Datacomputer;  and  (4)  seismic  related 
Datacomputer  development. 


The  SIP  consists  of  a DEC  PDP-11/40  with  28k  core,  two  RP04 
disks  and  an  Arpanet  host  interface  (see  Fig.  1).  All  of  this 
SIP  hardware  has  been  installed,  connected  to  the  network,,  and 
is  working  properly.  Software  development  is  well  underway. 

In  continuing  coordination  with  the  seismic  community,  CCA 
reviewed  and  made  comments  on  the  seismic  file  structures 
transmitted  to  it  by  VSC.  A meeting  was  held  to  further  refine 
these  descriptions. 

The  TBM  memory  system  is  being  built  by  Ampex  Corporation  as  a 
subcontractor  to  CCA.  All  hardware  components  have  been 
fabricated  and  are  undergoing  tests  at  Ampex.  The  initial  TBM 
configuration  will  be  one  transport  driver,  two  dual  transport 
modules,  one  data  channel  and  a Communications  and  Control 
System  (CCS).  Generally  acceptable  software  specifications  for 
the  TBM  have  Deen  received  by  CCA.  Ampex  has  informed  CCA 
that  delivery  of  the  complete  TBM  system  will  be  in  January 
1976. 

A number  of  specific  improvements  have  been  made  in  the  Data- 
computer  and  its  underlying  TENEX  operating  system  for  the 
seismic  application.  These  improvements  appear  to  be  adequate 
for  the  initial  seismic  data  rate.  Further  improvements  have 
been  designed. 


2.  The  SIP 

Seismic  array  data  will  be  collected  from  the  Arpanet,  buffered, 
and  reformatted  by  a small  Seismic  Input  Processor  (SIP)  which 
will  retransmit  the  data  to  the  Datacomputer . The  SIP  is 
equipped  with  disk  storage  adequate  for  24-hour  buffering  of  a 
15  kilobit  per  second  (kbps)  data  stream. 

2.1  The  Software 

The  CCP-SIP  special  protocol  has  been  checked  out  from  the  SIP 
end  and  data  was  received  from  the  CCP  and  stored  in  unparsed 
form  on  the  SIP’s  disk.  This  was  done  with  STPLAB,  a testbed 
produced  by  modifying  a network  interface  test  program. 


A Digital  Equipment  Corporation  (DEC)  RSX-11M  operating  system 
was  installed  on  August  7*  As  expected,  RSX-11M  facilitated 
the  successful  development  of  a Network  Control  Program  (NCP), 
which  has  been  completed,  suitable  for  the  STP.  However, 
attempts  to  integrate  the  rest  of  the  SIP  system  with  the  NCP 
under  RSX-11M  ran  into  problems  in  that  the  RSX  file  system 
takes  4096  more  words  of  memory  than  is  documented.  To 
accommodate  this  would  require  not  only  the  purchase  of  addi- 
tional memory,  but,  in  addition,  since  memory  would  then 
exceed  the  direct  addressing  capability  of  the  SIP  computer, 
memory  management  hardware  and  further  software  to  utilize 
the  memory  management  hardware  would  be  required.  One  solution 
to  this  problem  is  to  proceed  with  the  integration  of  the  SIP 
by  replacing  the  RSX-11M  operating  system  functions  used  by  the 
NCP  with  functions  already  Implemented  for  the  portion  of  the 
SIP  that  communicated  with  the  CCP. 


2.2  IMP-TIP  Swap 

On  September  15,  the  CCA-IMP  was  replaced  by  a 516  IMP.  A 
30%  Improvement  was  expected  with  a 316  IMP  after  removing  the 
TIP  load;  however,  about  a factor  of  two  improvement  was  found 
due  to  the  516  being  a faster  computer  than  the  316.  Thus  the 
local  bandwidth  problem  has  been  solved  for  the  initial  seismic 
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data  rates.  Long  distance  network  traffic  !s  more  sensitive 
to  buffer  availability  and  may  still  present  a problem  unless 
the  Lincoln  Laboratories'  VDH  is  moved  to  another  IMP.  Bolt 
Beranek  and  Newman  Inc  (BBN)  is  also  working  on  reducing  the 
overhead  in  the  network  associated  with  routing  messages. 

Since  several  TIP  lines  were  in  use  at  CCA,  one  as  a simple 
means  of  communicating  with  the  SIP  from  CCA-TENEX,  some  way 
of  supplying  more  terminal  lines  at  CCA  was  necessary. 
Increasing  the  number  of  terminal  lines  on  the  CCA-TENEX  line 
scanner  by  adding  a DC-10B  eight  line  group  seems  to  be  the 
best  way  to  do  this. 

2.3  The  Hardware 

The  hardware  complement  for  the  SIP  is  complete  except  that  a 
permanent  status  display  has  not  been  procured.  A 96OO  baud 
terminal  is  currently  being  used  as  a status  display.  No 
significant  hardware  problems  have  arisen  this  quarter. 


3-  Coordination  with  Seismic  Community 

In  continuing  coordination  with  the  seismic  community,  CCA 
reviewed  the  seismic  file  descriptions  communicated  to  it  by 
VSC,  sending  comments  back  to  VSC  and  participating  in  a meeting 
concerning  the  file  descriptions  as  described  below.  Various 
tests  of  the  descriptions  after  this  meeting  revealed  a large 
number  of  minor  problems,  many  related  to  byte  sizes,  that  have 
been  fixed  by  very  minor  changes  in  the  descriptions . Most  of 
these  byte  size  problems  ultimately  stem  from  attempting  to 
manipulate  8,  16  and  32  bit  bytes  with  maximum  efficiency  using 
hardware  byte  facilities  that  are  sensitive  to  a 36  bit  word 
size.  In  considering  the  difficulty  of  perfecting  the  seismic 
file  descriptions,  one  should  keep  in  mind  that  they  represent 
at  least  fifteen  different  types  of  files,  some  with  descrip- 
tions more  than  a page  long  as  compared  with  the  largest  pre- 
vious similar  application  of  the  Datacomputer  (the  Arpanet  real 
time  survey  files  produced  by  MIT-DMS)  that  involved  two 
different  types  of  files  each  with  a description  less  than 
10  lines  long. 

3.1  Comments  on  Seismic  File  Descriptions 
CCA  reviewed  the  seismic  file  formats  submitted  by  VSC  and 
sent  a memo  to  VSC  with  suggestions  for  possible  improvements. 
Some  of  these  suggestions  related  to  efficiency,  some  to  recent 
and  imminent  extensions  to  the  Datacomputer  that  might  be  of 
use  in  the  seismic  application,  and  some  to  clarification  of 
minor  points  in  the  organization  of  the  files. 

For  efficiency  reasons,  it  was  suggested  that  fewer  inversions 
be  used  and  pointed  out  that  some  storage  was  being  wasted  by 
certain  byte  size  specifications.  It  was  also  pointed  out  that 
for  the  high  bandwidth  raw  data  files,  it  would  excessively 
degrade  bandwidth,  due  to  the  computational  effort  involved, 
to  use  inversions  or  large  amounts  of  data  convers.  on  and  byte 
realignment  and  padding.  Uniform  use  of  8 and  16  bit  bytes 
was  urged. 
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Features  suggested  for  use  in  the  seismic  files  included 
virtual  indexes,  which  are  more  general  than  inversions  in 
some  way  (see  Datacomputer  User  Manual),  the  recently 
introduced  ability  to  give  the  Datacomputer  information  about 
the  expected  size  of  a file  (as  well  as  minimum  and  maximum 
limits),  and  a minor  generalization  of  the  inversion  feature. 
It  was  suggested  that  some  of  the  multi-file  coordination 
problems  of  the  seismic  application  may  be  helped  by  the  file 
group  feature  planned  for  the  Datacomputer. 

Finally  some  suggestions  for  the  directory  organization  of 
the  seismic  files  were  made  along  with  a number  of  minor 
specific  changes  to  certain  of  the  files. 

3.2  Meeting  on  Seismic  File  Formats 
A meeting  was  held  in  Alexandria,  Virginia  on  September  11 
to  try  to  optimize  and  settle  on  seismic  file  descriptions 
and  a directory  organization  fo.  them.  A number  of  minor 
changes  were  made  in  the  file  descriptions  to  fix  problems 
or  enhance  capabilities  and  the  following  major  decisions 
were  made: 


(1)  The  previous  "array"  raw  data  files  v/ill  be  broken 
up  into  separate  files  by  site.  This  will  make  it  easier  to 
add  and  delete  sites  and  make  each  site  file  much  simpler  and 
easier  to  manipulate  in  an  optimum  manner  than  the  composite 
file  would  have  been. 

(2)  To  take  advantage  of  the  hierarchical  file  directory 
of  the  Datacomputer,  the  pathnames  of  the  seismic  files  will 
be  split  into  type,  site  (which  applicable),  year,  month,  and 
day  (except  for  lower  data  rate  files  that  are  divided  by 
month) . 
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(3)  Virtual  indexes  were  extensively  inserted  to  speed 
retrieval  of  data  by  being  able  to  specify  physical  positions 
in  files.  Physical  position,  for  many  seismic  files,  corres- 
ponds directly  to  the  real  time  of  a data  item. 


4 . The  TBM 

An  Ampex  Terabit  Memory  System  (TBM)  will  be  part  of  the  Data- 
computer  to  provide  the  required  large  amount  of  on-line  storage. 
The  TBM  requires  site  modifications  at  CCA. 

4.1  Software  Specifications 

New  software  specifications  for  the  TBM  have  been  received 
from  Ampex.  These  specifications  are  generally  acceptable  for 
the  initial  TBM  installation  at  CCA.  There  are  still  a few 
minor  problems,  particularly  regarding  the  file-id  number  to 
be  recorded  in  the  tally  track  for  each  block  and  used  for 
error  checking. 

The  new  software  specifications  are,  however,  inadequate 
for  possible  future  expansions  of  the  TBM.  More  dual  transport 
units  can  be  added  and  more  TBM  internal  data  channels  for 
redundancy  (only  one  channel  to  be  used  at  a time)  can  be  added 
easily.  But,  to  utilize  additional  transport  drivers  or  CIU's 
(for  multiple  simultaneous  data  transfers)  would  require  some 
changes  in  the  software  specifications  and,  according  to  Ampex, 
extensive  enhancements  to  the  internal  TBM  software.  These 
matters  were  discussed  by  CCA  and  Ampex  personnel  at  CCA  on 
October  2 and  3. 

4.2  Site  Preparation 

Agreement  has  been  reached  on  all  particulars  with  Control 
Data  Corporation  (CDC),  our  general  contractor,  for  the  site 
preparation  work  necessary  for  TBM  installation.  Although  all 
of  the  work,  including  installation  of  such  new  heavy  equipment 
as  air  conditioning  and  electric  supply  transforner  will  take 
several  weeks,  it  has  been  determined  that  the  most  disruptive 
work  can  be  concentrated  into  one  week. 
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The  moving,  replacing,  and  cutting  of  wall,  ceiling  and  false 
floor  elements,  generating  metallic  particles  and  filter 
penetrating  plaster  dusk,  are  the  most  disruptive  and 
potentially  damaging  of  the  planned  site  work.  This  sort  of 
work  will  all  be  concentrated  in  one  week  during  which  time 
all  computer  equipment  in  the  area  will  be  turned  off  and 
after  which  the  area  will  be  thoroughly  cleaned  before  any 
equipment  is  turned  on  again.  This  week  of  down  time  for 
site  preparation  will  interfere  with  other  aspects  of  this 
project  and  other  CCA  activities. 

^.3  The  Hardware 

All  of  the  hardware  components  of  the  TBM,  as  described  in 
the  last  quarterly  technical  report , are  fabricated  and  have 
been  integrated.  They  are  undergoing  tests  at  Ampex. 


5.  The  Datacomputer 

Experimentation  with  the  type  of  requests  that  the  SIP  will 
make  to  insert  information  in  place  in  seismic  array  files 
revealed  several  inefficiencies  that  had  not  previously  been 
noticed  in  dealing  with  smaller  files.  After  removal  of  the 
worst  of  these  inefficiencies,  it  was  demonstrated  that  seismic 
data  could  be  stored  faster  than  the  60  kbps  which  appears 
adequate  for  now. 

Two  modifications  were  made  to  the  Datacomputer ’ s underlying 
TENEX  operating  syste.a  in  anticipation  of  software  integration 
of  the  TBM.  First,  track-at-a-time  input  from  and  output  to 
the  large-scale  disk  to  be  used,  in  some  cases,  for  staging  of 
TBM  data  was  implemented.  This  represents  approximately  a 
three-fold  increase  in  bandwidth  over  the  record-at-a-time  disk 
I/O  that  had  previously  been  the  only  type  available.  Second, 
TENEX  was  modified  so  as  to  be  able  to  use  more  than  262,144 
words  of  main  memory.  More  than  this  previous  memory  limit 
has  been  installed  and  is  necessary  for  TBM  buffers. 

Modifications  to  the  Datacomputer' s underlying  TENEX  operating 
system  have  been  designed  to  allow  efficient  handling  of  32 
bit  network  connections.  Previously  it  had  been  assumed  that 
36  bit  (the  Datacomputer ' s word  size)  network  data  connections 
with  4 bits  of  padding  per  36  bits  would  have  to  be  used 
between  the  SIP  and  the  Datacomputer.  The  seismic  data  is 
fundamentally  structured  in  16  bit  units.  If  this  modification 
is  implemented  and  proves  workable,  an  approximate  10%  overall 
saving  in  bandwidth  between  the  SIP  and  the  Datacomputer  will 
be  made. 
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DATE  ClCCIVCD 


TYPfn  NAME 
AND  Oi  FICE 


SIGNATURE  Or  A'JTH  GOVT  KWP 


* If  q iznhiy  icccivcd  by  ll*e  Govern 
quantity  jlupr*J,  ird'Cdh  by  ( (/ 


*«f  rt  lie  tone  at 

tvu","F  »"'|'r.-v,  H'M>;nr  wy  « \r  I m 1rl,  »f  dtf' 
leipnf,  enter  actual  quunMy  received  b^la*  quantity 
shipped  ond  encircle.  


23.  CONTRACTOR  USE  ONLY 
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